A Method for Text Localization and Recognition in Real-World Images
Identifieur interne : 000524 ( Main/Exploration ); précédent : 000523; suivant : 000525A Method for Text Localization and Recognition in Real-World Images
Auteurs : Lukas Neumann [République tchèque] ; Jiri Matas [République tchèque]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2011.
Abstract
Abstract: A general method for text localization and recognition in real-world images is presented. The proposed method is novel, as it (i) departs from a strict feed-forward pipeline and replaces it by a hypotheses-verification framework simultaneously processing multiple text line hypotheses, (ii) uses synthetic fonts to train the algorithm eliminating the need for time-consuming acquisition and labeling of real-world training data and (iii) exploits Maximally Stable Extremal Regions (MSERs) which provides robustness to geometric and illumination conditions. The performance of the method is evaluated on two standard datasets. On the Char74k dataset, a recognition rate of 72% is achieved, 18% higher than the state-of-the-art. The paper is first to report both text detection and recognition results on the standard and rather challenging ICDAR 2003 dataset. The text localization works for number of alphabets and the method is easily adapted to recognition of other scripts, e.g. cyrillics.
Url:
DOI: 10.1007/978-3-642-19318-7_60
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000217
- to stream Istex, to step Curation: 000214
- to stream Istex, to step Checkpoint: 000181
- to stream Main, to step Merge: 000530
- to stream Main, to step Curation: 000524
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">A Method for Text Localization and Recognition in Real-World Images</title>
<author><name sortKey="Neumann, Lukas" sort="Neumann, Lukas" uniqKey="Neumann L" first="Lukas" last="Neumann">Lukas Neumann</name>
</author>
<author><name sortKey="Matas, Jiri" sort="Matas, Jiri" uniqKey="Matas J" first="Jiri" last="Matas">Jiri Matas</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:285364B6623C7301C6B9380A708BED60EE238BBF</idno>
<date when="2011" year="2011">2011</date>
<idno type="doi">10.1007/978-3-642-19318-7_60</idno>
<idno type="url">https://api.istex.fr/document/285364B6623C7301C6B9380A708BED60EE238BBF/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000217</idno>
<idno type="wicri:Area/Istex/Curation">000214</idno>
<idno type="wicri:Area/Istex/Checkpoint">000181</idno>
<idno type="wicri:doubleKey">0302-9743:2011:Neumann L:a:method:for</idno>
<idno type="wicri:Area/Main/Merge">000530</idno>
<idno type="wicri:Area/Main/Curation">000524</idno>
<idno type="wicri:Area/Main/Exploration">000524</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">A Method for Text Localization and Recognition in Real-World Images</title>
<author><name sortKey="Neumann, Lukas" sort="Neumann, Lukas" uniqKey="Neumann L" first="Lukas" last="Neumann">Lukas Neumann</name>
<affiliation wicri:level="3"><country xml:lang="fr">République tchèque</country>
<wicri:regionArea>Center for Machine Perception, Czech Technical University, Prague</wicri:regionArea>
<placeName><settlement type="city">Prague</settlement>
<region type="région" nuts="2">Bohême centrale</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Matas, Jiri" sort="Matas, Jiri" uniqKey="Matas J" first="Jiri" last="Matas">Jiri Matas</name>
<affiliation wicri:level="3"><country xml:lang="fr">République tchèque</country>
<wicri:regionArea>Center for Machine Perception, Czech Technical University, Prague</wicri:regionArea>
<placeName><settlement type="city">Prague</settlement>
<region type="région" nuts="2">Bohême centrale</region>
</placeName>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2011</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">285364B6623C7301C6B9380A708BED60EE238BBF</idno>
<idno type="DOI">10.1007/978-3-642-19318-7_60</idno>
<idno type="ChapterID">60</idno>
<idno type="ChapterID">Chap60</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: A general method for text localization and recognition in real-world images is presented. The proposed method is novel, as it (i) departs from a strict feed-forward pipeline and replaces it by a hypotheses-verification framework simultaneously processing multiple text line hypotheses, (ii) uses synthetic fonts to train the algorithm eliminating the need for time-consuming acquisition and labeling of real-world training data and (iii) exploits Maximally Stable Extremal Regions (MSERs) which provides robustness to geometric and illumination conditions. The performance of the method is evaluated on two standard datasets. On the Char74k dataset, a recognition rate of 72% is achieved, 18% higher than the state-of-the-art. The paper is first to report both text detection and recognition results on the standard and rather challenging ICDAR 2003 dataset. The text localization works for number of alphabets and the method is easily adapted to recognition of other scripts, e.g. cyrillics.</div>
</front>
</TEI>
<affiliations><list><country><li>République tchèque</li>
</country>
<region><li>Bohême centrale</li>
</region>
<settlement><li>Prague</li>
</settlement>
</list>
<tree><country name="République tchèque"><region name="Bohême centrale"><name sortKey="Neumann, Lukas" sort="Neumann, Lukas" uniqKey="Neumann L" first="Lukas" last="Neumann">Lukas Neumann</name>
</region>
<name sortKey="Matas, Jiri" sort="Matas, Jiri" uniqKey="Matas J" first="Jiri" last="Matas">Jiri Matas</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000524 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000524 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:285364B6623C7301C6B9380A708BED60EE238BBF |texte= A Method for Text Localization and Recognition in Real-World Images }}
This area was generated with Dilib version V0.6.32. |